Object Classification using RGB-D data for Vision Aids Apples and Oranges
نویسندگان
چکیده
With the increased availability of cheap and reliable depth sensors, imaging systems can now use depth information to better detect and locate objects in a scene. New augmented reality (AR) systems, such as the Microsoft HoloLens, are now mounting depth sensors on their glasses to improve functionality. The motivation behind our project is to perform object recognition using RGB-D (color and depth) data for a vision aid being developed at Stanford. This vision aid consists of a pair of AR goggles with an Asus Xtion depth sensor mounted on top of it. We would like to use the RGB-D data from this sensor to train a classifier that can recognize household items from a database. Information on the classified object can then be relayed to a user through the vision aid. This system has the potential to help the visually impaired navigate and perform everyday tasks more efficiently. Our training data consists of an RGB-D data set from a team at the University of Washington. We used two different 3D descriptors to extract feature vectors that describe each frame of RGB-D data. The input features to our algorithm are these descriptor vectors. We then use SVM to train a classifier that can output the predicted class of new RGB-D images. Using test data we collected with our own experimental setup, we evaluated the effectiveness of our classifier. The best cross validation results gave 89% accuracy while using the SHOTCOLOR descriptors and a subset of tested items. Experimental results from this model had a prediction accuracy of 83% when presented with testing data obtained from our experimental setup.
منابع مشابه
Comparing Apples to Oranges: Common Trends and Thresholds in Anthropogenic and Environmental Pressures across Multiple Marine Ecosystems
Citation: Tam JC, Link JS, Large SI, Andrews K, Friedland KD, Gove J, Hazen E, Holsman K, Karnauskas M, Samhouri JF, Shuford R, Tomilieri N and Zador S (2017) Comparing Apples to Oranges: Common Trends and Thresholds in Anthropogenic and Environmental Pressures across Multiple Marine Ecosystems. Front. Mar. Sci. 4:282. doi: 10.3389/fmars.2017.00282 Comparing Apples to Oranges: Common Trends and...
متن کاملDetection and Classification of Multiple Objects using an RGB-D Sensor and Linear Spatial Pyramid Matching
This paper presents a complete system for multiple object detection and classification in a 3D scene using an RGB-D sensor such as the Microsoft Kinect sensor. Successful multiple object detection and classification are crucial features in many 3D computer vision applications. The main goal is making machines see and understand objects like humans do. To this goal, the new RGB-D sensors can be ...
متن کاملHand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کاملComparing apples and oranges: a randomised prospective study.
For many years the comparison of apples and oranges was thought to be impossible. Many authors use the analogy of the putative inability to compare apples and oranges as a means of scornfully reviewing the work of others. The titles of some recent publications 2 suggest an actual comparison of apples and oranges, but the authors do not, in fact, compare these two fruits. Our laboratory has been...
متن کاملObject-Based Classification of UltraCamD Imagery for Identification of Tree Species in the Mixed Planted Forest
This study is a contribution to assess the high resolution digital aerial imagery for semi-automatic analysis of tree species identification. To maximize the benefit of such data, the object-based classification was conducted in a mixed forest plantation. Two subsets of an UltraCam D image were geometrically corrected using aero-triangulation method. Some appropriate transformations were perfor...
متن کامل